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ABSTRACT 

The  robustness  (asymptotic  conservativeness)  of  Kruskal -Wallis  test 
under  certain  departures  from  mutual  independence  of  K  univariate  samples  is 
established.  This  robustness  provides  a  procedure  for  testing  the  equality  of  K 
marginal  distribution  functions  based  on  a  broken  random  sample  from  a  K-variate 
distribution  which  satisfies  a  mild  condition.  For  the  unbroken  sample,  a 
generalized  Kruskal-Wallis  test  is  proposed  for  testing  the  symmetry  of  a 
K-dimensional  distribution  function.  The  relative  efficiency  of  the  K-W  test 
against  the  aligned  rank  order  test  is  also  examined  under  the  normal  shift 
model , 
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1 .  INTRODUCTION 


The  problem  of  testing  symmetry  of  a  K-dimensional  distribution  function 
in  its  K  arguments  is  discussed  in  many  elementary  nonparametr ic  statistics 
books  (c.f.  Lehnann  (1975),  Hollander  &  Wolfe  (1973)).  One  version  of  this 
problem  is  testing  the  hypothesis  of  no  difference  among  K  treatments  in  a 
randomized  complete  block  design  with  random  block  effect  or  with  fixed  block 
effect  based  on  aligned  observations  (c.f.  Lehmann  (1975),  p.  270).  However, 
if  the  sample  is  broken  (c.f.  DeGroot,  Feder,  and  Goel  (1971)),  all  the  test 
procedures  for  testing  H^  are  no  longer  valid.  It  is  certainly  interesting 
to  see  if  anything  can  be  done  about  this  situation. 

More  specifically,  suppose  that  a  random  sample  (Y^ ^ ,... ,YRj) ',..., (Yln, ••• »Ygn 
of  size  n  is  drawn  from  a  K-variate  population  with  a  continuous  joint  distri¬ 
bution  function  G(y  ,...,y  )  and  marginal  distribution  functions  F  ,  i  =  . . K. 

1  In  i 

The  hypothesis  which  we  wish  to  test  is  H^:  =  • • •  =  F^.  It  Is 

assumed  that  the  distribution  function  G  is  symmetric  in  its  K  arguments  under 

Hq.  Now,  however,  suppose  that  before  the  sample  values  can  be  recorded,  each 

observation  vector  (Y  ,...,Y  )'  is  broken  into  K  separate  components  Y  , 

l  J  Ivj  lj 

i  »  1,...,K  and  the  observations  from  each  treatment  i  are  available  only  in 

the  form  Y,.  ,...,Y,,  ,  where  j,,...,j  is  some  unknown  permutation  of  l,...,n. 
ljl  ljn  1  n 

Since  the  blockings  of  Y  ,...,Y  (j  -  l,...,n)  in  the  original  sample  are 

lj  Kj 

not  known  at  the  data  analysis  time,  the  observed  values  are  called  a  broken 
random  sample  from  the  given  K-varlate  population.  This  unfortunate  situation 
is  not  uncommon  in  practice.  An  interesting  example  was  cited  by  Hollander, 

Pledger,  and  Lin  (1974). 

The  K-sample  Kruskal-Wallis  test  (Kruskal  (1952),  Kruskal  &  Wallis  (1952)) 
is  not  a  distribution  free  test  for  the  above  situation;  however,  in  this 
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article  we  shall  demonstrate  that  the  K-W  test  is  robust  (asymptotic  conservative) 
under  a  mild  condition  on  the  distribution  G.  That  is,  the  asymptotic  true 
level  of  significance  of  the  K-W  test  for  testing  based  on  the  broken  sample 

is  never  greater  than  its  preassigned  nominal  value,  provided  that  the  under¬ 
lying  joint  distribution  G  is  (pairwisely)  positively  quadrant  dependent,  i.e., 

P(Y±j  *  a,  Y±tj  *  b)  *  P(Yy  <  a)P(Yitj  <  b) , 

for  all  (a,b)  ’  e  R2;  i,i'  =  1,...,K;  i  i*  i  * ;  j  ■  l,...,n  (c.f.  Lehnann  (1966)). 
This  has  been  shown  to  be  a  relatively  weak  form  of  positive  dependence  (c.f. 

F.sary  &  Proschan  (1972)).  In  practice,  the  positive  dependence  would  be 
expected  among  observations  within  the  same  block  in  many  situations.  This  is 
especially  true  for  the  randomized  complete  block  design  since  subjects  in  the 
same  block  tend  to  be  homogeneous.  For  K  *  2,  the  K-W  test  is  equivalent  to 
the  two-sample  Wilcoxon  test  whose  asymptotic  conservativeness  was  studied  by 
Hollander,  Pledger  and  Lin  (1974).  Recently  Rinott  and  Poliak  (1978)  have 
established  the  robustness  of  a  class  of  test  statistics  which  essentially  are 
functionals  of  the  difference  between  the  empirical  distribution  functions  of 
two  positively  dependent  univariate  samples. 

In  the  second  half  of  this  article,  a  generalized  Kruskal-Wallls  test  is 
proposed  for  testing  based  on  the  unbroken  sample.  This  test  allows  all 
possible  interblock  comparisons  and  is  shown  to  be  asymptotically  distribution 
free.  Since  the  aligned  rank  order  test  also  utilizes  the  information  con¬ 
tained  in  the  interblock  comparisons  and  has  been  demonstrated  to  be  more 
efficient  than  the  Friedman  test  (c.f.  Hodges  &  Letmunn  (1962),  Mehra  & 

Sarangi  (1967),  and  Sen  (1968)),  the  asymptotic  efficiency  of  the  generalized 
K-W  test  with  respect  to  the  aligned  rank  order  test  is  examined  in  this  paper. 
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2.  ASYMPTOTIC  CONSERVATIVENESS  OF  KRUSKAL-WALLIS  TEST 


We  define  the  scoring  function  <f>  for  comparing  two  observations  Y 


U 


and  ^i.j.  by 


l*  Yij>Vj- 

*(Yij»Ti,j,)  *  5  _i»  Yij <  Yi'j' 

0,  otherwise. 


Let  p.^,  =  / (2F^ , (y)-l)dF^(y) ,  where  i  ?  i' .  Under  H^,  p  ,  *  0.  The  ith 

component,  W  ,  of  the  vector  score  statistic  W  is  defined  to  be  the  total 
in  ~n 

score  comparing  the  ith  sample  with  the  ramining  K-l  samples,  i.e., 

K  n  n 

wln=  I  I  I  <*<Yi1'Y1'1')-pii') 

i  *1  j=*l  j'-l  LJ  1  J  11 


V  +  i 


and 


<%. . V- 

-3/2 

Theorem  1 .  As  n  +  ”,  n  converges  in  distribution  to  a  K -dimensional 

normal  random  vector  with  mean  0  and  covariance  matrix  j  ■  (o^,),  where 

K  K 

m  ii  new i(  Y  fv  ( V  ^ V  ( V  ^ ^ 

ii 

mjM  W i* 

1  *  i,i*  <  K. 

n  n 


■  4  “V(J,  J,  <rt<Ti'i>"Fr(Yn)“’Vf)) 


Proof .  Let 


"ll'.n  ’  "'3/2  ^  j.1,  <*aiJ.Ti,J,)-p,1,). 


4>iii<y)  “  E<|>(y,Yi,J,)-plll  -  2Fit(y)  -  1  -  p^.,  and 
♦ii.W  -  -  Piit  -  1  -  2F1(y)  -  p^,. 
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Also,  let 


g(a,b)  =  <p(a,b)  -  pii(  -  <j>°it(a)  -  and 

V',,  ■  "'l/!  [j,  <*u-<v  + 

'  2n'l/2  J,  °v(V  ‘  w? ' 


U'.n  -V,n)2-”'3MJlJi1  g<YU'Yfl'; 


n  n  n  n 


l  l  l  l  MJ.j'.k.k'), 
j=*l  j*-l  k-1  k'-l 


(2.1) 


where  h(j,j\k,k')  *  ECg^^  .Y^  ^  ^B^^.Y^,)  ]  •  Because  g  is  bounded,  we 

3 

can  ignore  any  u  terms  of  the  sum  in  (2.1)  if  u  is  of  order  o(n  ).  Consider 

the  following  cases  for  which  the  number  of  terms  is  with  order  larger  than  or 
3 

equal  to  0(n  )  (where  j,j',k,  and  k’  represent  four  distinct  indices): 


(1)  h(j,j\k,k')  -  E8(Yij*Y1'j^E8CYik.Y1,k,)  -  0; 

(2)  h(j,j,k,k’)  =  h(j,j’,k,k)  *  0; 

(3)  h(j,j\j,k’)  =  h(j  ,j  ’  ,k,j  *) 

-  E{Etg(Ylj  ,Y1,i  ,)g(Yij  ,Y±  ,k,)  |Yy  ]} 

-  E{E[g(Y1J.Yifr)|Yij]E[g(Yij.Yi,kf)|YiJ]}  -  0; 

(A)  h(j,J’,k,j)  -  h(J,j\j\k’)  -  EECB(Ylj,Yi,j,)g(Ylk,Yi,j)|Y1J,Y1,j] 

-  E{E(g(Ylj,Ylljf)|Ylj,Yitj]ECg(Ylk,Yi,j)|Ylj,Yi,J3)  -  0. 
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*  2 

It  follows  that  E(U  ,  -  U  .  )  +  0,  as  n  +®.  By  Corollary  6  of  Lehmann 

lx  *n  li  ,n 

* 

(1975,  p.  289)  and  the  fact  that  the  vector  <U^t  n>  has  a  K(K-l)/2-dimensional 
normal  limiting  distribution  N(0,A),  where  A  =  an^ 

W'  '  4«Ff(Yc1>  -  VW  -  Ptt-)(Fk'<V-VVl>-'W>’ 

1  <  f,£',k,k'  ^  K;  £  4  £' ;  k  |f  k1,  the  vector  <U  ,  >  has  the  same  multi- 

ii  ,n 

-3/2 

variate  normal  limiting  distribution  N(0,A).  Therefore,  n  has  a 

multivariate  normal  limiting  distribution  with  mean  0  and  covariance  matrix 
J  as  is  stated  in  the  Theorem. 

Corollary  1.  If  the  joint  distribution  G  of  (Y^  , . . .  ,Y^ ) *  is  symmetric  in 
its  K  arguments  and  F  is  the  common  marginal  distribution  function,  then 


o2  n 


n  o 


n  n 


K  *  K, 


where  =  4K(K-1) (1/12-5) ,  H  -  -4K(l/12-5),  and  K  -  cov(F(Y  ) ,  FtY^  )). 


Proof .  Let  V . 


I\ 

(K-l)F(Y  .)  -  J  F(Y„  ).  Then,  it  can  be  shown  that 

ij  it  l  l3 


0*4  var(Vtj)  -  4K(K-1) (1/12-0  and  n  -  4  covO^.V^  ^)  "  -**(1/12-0. 


2 

The  latent  roots  of  J  are  (o  +  (K-l)n)  ■  0  with  the  latent  vector 

q„  -  —  (1,1,...,!)'  and  (o2  -  n)  of  multiplicity  (K-l)  with  (K-l)  orthonormal 

latent  (any  set  of)  vectors  orthogonal  to  —  (1,1,...,!)',  q, , ...,q_  ,,  say 

/k  ~l  ~K-1 

(c.f.  Rao  (1973),  p.  67).  In  practice,  the  q  are  perhaps  most  easily  found 
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by  means  of  the  Gram-Schmidt  orthogonal izat ion  process.  Since  J  is  symmetric, 
therefore,  Q'Jq  =  ARXR,  where  Q  =  (q  ,g2 . qR) ,  ARxR  -  diag(X . X,0)  and 


X  =  o'"  -  n.  It  follows  that  if  G  is  symmetric  in  its  K  arguments. 


,  K-l  , 

s  =  n~J  x  y  cq'  w r 

n  -»i  ~n 


(2.2) 


is  asymptotically  distributed  as  a  chi-squared  distribution  with  K-l  degrees 

of  freedom.  Note  that  if,  for  each  j,  1  S  j  <  n,  Y.  , . . .  ,Y  are  mutually 

1  j  K.J 

2 

independent,  then  X  =  K  /3  and  S  in  (2.2)  becomes  the  usual  K-sample  Kruskal- 

n 

Wallis  test  statistic.  The  next  result  gives  the  asymp‘ otic  conservativeness 
of  the  K-W  test  for  testing  under  a  certain  dependent  model. 

Corollary  2.  If  G  is  symmetric  in  its  K  arguments  and  (pairwisely)  positively 
quadrant  dependent,  then 

K-l 

lim  P(3n  3  K  2  £  [q'  W  ]2  >  X?  >  s  a>  where  y2 

n-*«°  i-1  ~n  1-0  1_a 

is  the  (l-a)th  quantile  of  a  chi-square  random  variable  with  K-I  degrees  of 
freedom. 


Proof .  Since  G  is  (pairwisely)  positively  quadrant  dependent,  it  is  easily 
seen  that  5  =  cov(F(Y1j)  ,F(Ylt  ))  £  0.  Therefore,  X  -  4R2(l/12-5)  s  R2/j. 


3.  A  GENERALIZED  KRUSKAL-WALLIS  TEST  AND  ITS  EFFICIENCY 

If  the  sample  (Y. ,Y„, )',..., (Y,  , ...,Y„  )'  is  not  broken,  the 
IX  KX  in  Ml 

parameter  X  in  (2.2)  can  be  consistently  estimated  from  the  data.  One  such 

A 

consistent  estimator  X,  say,  for  the  case  G  is  symmetric  in  its  K  arguments, 
is  K2 ( 1/3  -  45),  where 


£  =  /( Fn(x)  -  l/2)(Fn(y)  -  l/2)dHn(x,y) , 


K  n 


F  (x)  -  (nK)  V  7  I(Y  <  x), 
n  ij  — 

i=*  1  j  - 1 

H  (x,y)  =  2(nK(K-l))  l  1  V  I(Y  <  x,Y  <  y) , 
n  i*i'  J-l  lj  “  1  J  - 

and  1(A)  is  the  indicator  function  of  the  event  A.  The  consistency  of  the 

A  A  3. S • 

estimator  £  can  easily  be  obtained  through  the  fact  that  sup[F  (x)-F(x)|  — >  0, 

^  a.  s.  x  n-w» 

sup  | H  (x,y)  -  H(x,y)|  _>  0,  and  the  continuity  of  F,  where  H(x,y)  is 

(x,y)  n  !»■**> 

the  joint  distribution  function  of  Y  ,  and  Y  ,  ,  1  <  i  I1  i'  <  K,  under  the 


and  Y  ,l<if*i'  <K,  under  the 
U  i  J  V_1 

A  _2  A  1  2 

ows  that  the  test,  S  -  n  X  >  (q'W  )  , 


condition  of  symmetry  of  G.  It  follows  that  the  test,  ■*  n  X  £  (q'^tO  , 
which  allows  interblock  comparisons,  is  an  asymptotically  distribution  free 
test  for  testing  the  symmetry  of  the  joint  distribution  function  G. 

It  is  certainly  interesting  to  examine  the  relative  efficiency  of  the 

A 

gernalized  K-W  test  based  on  S  with  respect  to  the  powerful  aligned  rank 

n 

order  test.  For  simplicity,  we  only  consider  the  case  K  =  2.  In  this  situation, 

the  aligned  rank  order  test  is  asymptotically  equivalent  to  the  signed-rank 

Wilcoxon  test  (c.f.  Lehmann  (1975),  p.  278).  Under  the  normal  shift  model, 

i.e.,  (Y ^  j  * Y2 j ^  *  is  a  bivariate  normal  random  vector  with  mean  (y40,y)  and 

covariance  matrix  (*’*?),  the  Pitman  efficiency  of  the  signed-rank  Wilcoxon 
P.1 
1/2 

test  is  (3/(Air(l-p)))  (c.f.  Lehmann  (1975),  p.  374)  for  testing 


H^:  0-0  against  H j :  0  >  0. 


(3.1) 


When  K  -  2,  the  statistic  S  is  equivalent  to  W  .  By  Theorem  1,  n  "3/2w,  has 

n  in  in 

a  normal  limiting  distribution  under  and  Hj  and  satisfies  four  conditions 
of  Lehmann  ((1975),  p.  371-372).  For  the  above  normal  shift  model,  the  Pitman 

a  1 

efficiency  of  the  test  S  for  testing  (3.1)  can  be  shown  to  be  [3/(4tt(1-12£)) ] 

n 


i 


Since  £ 
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The  robustness  (asymptotic  conservativeness)  of  Kruskal-Wallis  test  under 
certain  departures  from  mutual  independence  of  K  univariate  samples  is  estab¬ 
lished.  This  robustness  provides  a  procedure  for  testing  the  equlity  of  K 
marignal  distribution  functions  on  a  broken  random  sample  from  a  K-variate 
distribution  which  satisfies  a  mild  condition.  For  the  unbroken,  sample,  a 
generalized  Kruskal-Wallia  test  is  proposed  for  testing  the  symmetry  of  a 
K-dimensional  distribution  function.  The  relative  efficiency  of  the  K-W  teat 
1  Wed  tank  order  test  is  also  examined  under  the  normal  shift  *"* 
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